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STANDARD SCORES IN ARITHMETIC 



S. A. COURTIS 
Detroit Home and Day School 



For the attainment of efficiency in any manufacturing process 
three things are necessary: a well-defined, a "standard," product; 
a knowledge of the properties and possibilities of the material to be 
acted upon; and a means of measuring the effects produced by the 
process itself. Given these three, all else is but a matter of trial 
and selection. 

Education, from one point of view at least, is a manufacturing 
process. The raw material that is sent to a school is molded and 
shaped by the forces and experiences to which it is subjected. 
Marked changes are effected and the product has, in a general way, 
a commercial value that bears some relation to the complexity 
of the process and the degree of change produced. In a graded 
system promotion from grade to grade emphasizes the idea of 
"degree of change," of grade standards. Promotion from the 
fourth to the fifth grade, for instance, is dependent upon the acquisi- 
tion of a certain amount of knowledge, the attainment of a certain 
degree of skill. In an efficient school one does not expect to find 
children of fourth-grade ability doing eighth-grade work. Each 
higher grade reached is supposed to mean a correspondingly higher 
level of ability. 

The attempts of the writer to measure the efficiency of the teach- 
ing and the growth of the arithmetic classes under his control in the 
Liggett School have been described in some detail in previous num- 
bers of this Journal. That work gave some knowledge of the nature 
of the material to be acted upon in school work, and proved that it 
was possible to measure accurately the changes produced by teach- 
ing effort. Clearly defined standards alone were lacking. The 
present article will deal with an attempt to determine such standards 
from the measurement of a large number of children. 

Because of the complexity of the problem, the scope of the 
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investigation was limited to the simplest work with whole numbers 
in the four fundamental operations. The tests used were in all 
essential particulars like those previously described, and will not 
be discussed in detail. It is sufficient to say that in their construc- 
tion the utmost care was taken to eliminate the imperfections of 
previous tests caused by the inequalities of the units. The sub- 
ject-matter of the tests, eight in number, was as follows : 

Test No. i. Addition \ 

Test No. 2. Subtraction [,„,.. •, 

Test No. 3. Multiplication (Combinations, o- 9 ) 

Test No. 4. Division / 

Test No. 5. Copying figures (rate of motor activity) 

Test No. 6. Speed reasoning (simple one-step problems) 

Test No. 7. Fundamentals (abstract examples in the four operations) 

Test No. 8. Reasoning (two-step problems). 

The nature of the investigation made it essential that the tests 
should be given to all grades from the lowest to the highest under 
identical conditions. To this end detailed instructions for every 
phase of the work, together with special record sheets, answer cards, 
etc., were printed in the form of folders to be put into the hands of 
each examiner or scorer. As experience had shown the most 
probable source of error to be faulty timing, the time allowances 
were printed in red. In addition a personal letter of warning was 
sent with each package of tests. It is realized that no degree of care 
can prevent either accidental errors, or deliberate fraud, but the 
testing of many schools tends to eliminate the effects of the first, 
and the character of the persons who took part in the work makes 
the second improbable. A few cases of accidental errors were re- 
ported, and for these tests either due allowance has been made or the 
results rejected entirely. The writer believes the precautions taken 
to have been effective; and that the conditions under which the 
tests were given were sufficiently uniform for the results to reflect 
accurately the actual conditions. Check tests of several grades 
under absolutely uniform conditions and with mechanical timing 
have yielded similar results. 

The co-operation of many persons was secured by letters to 
personal friends, by circulars, and by editorial notice in this and 
two other magazines. For the kindness of these notices the writer 
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is very grateful. It not only brought the matter to a wider circle 
of interested persons than could have been reached without it, 
but it also gave the investigation a standing it would have been 
hard to secure in any other way. As a result inquiries were received 
from many parts of the United States and from several foreign 
countries. The total of requests and orders for tests exceeded 
30,000 copies. Ten thousand sets of tests were distributed free 
and eight thousand more were sold at the bare cost of printing. 
Teachers in England and Germany were interested in the work 
and the tests have been, or are to be, given in those countries also. 

From the above it must be evident that the undertaking owes 
its success to its co-operative features. While the writer alone is 
responsible for whatever defects there may be in the tests themselves 
or in the planning of the work, credit is due the many who executed 
the plans and bore the burden of giving and scoring the papers. 
The writer is very glad of this opportunity to express his grateful 
appreciation of the assistance rendered by all those whose generous 
support and careful work has made this article possible. 

Fire, accidents, sickness, and unavoidable delays of many kinds 
have prevented the completion of the work by many to whom the 
tests were sent. The tabulations of this article are based upon the 
returns received from some sixty or seventy schools in ten different 
states as shown below. 

CONTRIBUTORS TO STANDARD SCORES 



State 


City 




Name 


New Hampshire 


Littleton 




D. F. Carpenter, superintendent 


Massachusetts 


Dorchester 




A. R. Winter, sub-master, Mather 
School 




Fitchburg 




E. A. Kirkpatrick, head of Depart- 




Leominster 




ment of Psychology, State 
Normal School 


New York 


New York City 


H. C. Pearson, principal of Horace 








Mann School 








Miss Margaret Eves, acting super- 








visor of Elementary Mathe- 








matics, Ethical Culture School 




Brooklyn 




J. C. Bell, Psychology, Training 
School for Teachers 




Hastings- on- 


-Hudson 


W. R. Williams, principal 
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CONTRIBUTORS TO 


STANDARD SCORES— Continued 


State 


City 


Name 


Delaware 


Dover 


A. Crawford, principal 


Maryland 


Salisbury 


W. J. Holloway, county superin- 
tendent 


Virginia 


Farmville 


C. W. Stone, Department of Edu- 
cation, State Normal School 

Miss Mary D. Pierce, supervisor in 
Practice School 




Williamsburg 


H. E. Bennett, professor of educa- 




Toano 


tion, College of William and 
Mary 


Michigan 


Detroit 


A. Cotter, principal, Harris School 
W. C. Martindale, superintendent 
Misses Liggett, principals, Liggett 
School 




Kalamazoo 


S. 0. Hartwell, superintendent 


Indiana 


Marion 


J. T. Giles, superintendent 




Fort Wayne 


Miss Flora Wilber, principal City 
Normal School 




Bloomington 


H. L. Smith, superintendent 


Missouri 


St. Joseph 


Miss Fannie Brennen, principal 
Garfield School 


Kansas 


McPherson 


G. G. Pinney, superintendent 



In the list that follows the number of tests sent to each of the 
above is given that some idea may be had of the sizes of the different 
groups of children measured. There is no correspondence, however, 
between the order of schools in the two lists. The number of 
actual individual scores returned for tabulation was a little over 
9,000 for most of the tests. 

Number of tests sent to the individuals in the list above: 
2500, 1600, 1400, 1000, 750, 650, 500, 380, 350, 350, 350, 300, 200, 200, 196, 150, 
140, 120 

Total 12,171 

A record sheet for the scores of each individual measured was 
sent out with each set of tests. These, when completed, were 
returned and will be used, later in the year, as the basis of tabula- 
tions to determine standard scores by age, and the correlation 
between the abilities measured. For each grade, a copy of the 
grade record sheet was also returned. Upon this was shown the 
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distribution of the scores of the individuals composing the grade. 
Tabulations of these distributions are the bases of the tables below. 
For instance, returns were received from fifty eighth-grade classes 
(both A and B). The distributions of the classes having the 
highest, median, and lowest grade averages are given in Table I; 
also the total of the distributions of the entire fifty classes. The 
same facts are shown graphically in Fig. 1. 



500- 




Fig. 1. — Distribution of 1,400 individual eighth-grade scores. Scale at bottom 
of figure gives number of answers written per minute in test of multiplication tables; 
scale at left gives frequencies. Solid horizontal lines show range of individual scores 
in the classes having the lowest, highest, and median grade averages of the fifty 
classes contributing the individual scores. M = median score of the entire group = 41 ; 
A = average = 43; 5 = score selected as standard for the grade =49. 

In words the meaning of the table is that of 1,401 eighth-grade 
children measured as to their knowledge of the multiplication 
tables, 2 could write answers at a rate of but 5 answers a minute, 

TABLE I 

Distribution op Individual Scores in Speed Test, Multiplication 

Three Representative Classes; also Total of Fifty Classes; Eighth Grade 





No. in 
Class 


Scores 


Highest 

Median 

Lowest 


36 
43 
13 


0-9 


IO-I9 
I 


25 

4 
6 


35 

4 

18 

5 


45 

5 

16 

1 


55 
6 

3 


65 
13 

2 


75 
4 


85 

4 


95 


i°5 


115 


Total 


1,401 


2 


17 


180 


460 


410 


191 


77 


26 


22 


5 


10 


1 
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17 at the rate of 15 answers a minute, 180 at 25 per minute, and so 
on, the average for the entire group being 43. 

It is particularly to be noted that the wide range of the distribu- 
tion is caused not so much by great differences between schools as 
by great variation in the abilities of individual children. The 
amount of variation in the grade averages of different schools 



Speed Test- Addition 




Fig. 2. — Comparison of the distribution of fifty eighth-grade averages with that 
of forty-seven individual scores in a single eighth-grade class. Solid line, individuals; 
broken line, classes. The curves show that the individuals in a single class differ 
more than the grade averages of the fifty schools. 

does not equal the variation in the scores of the members of any 
one class. Table II and Fig. 2 show this plainly. The distribu- 
tion of an average class of 47 members (selected from the 9 
classes of the 51 having a membership of from 42 to 51) is given for 
comparison with the distribution of the 51 grade averages. The 
results are from the scores in the addition speed test, for many 
schools the ability in which the least variation was shown. 

TABLE II 
Speed Test, Addition: Eighth Grade 
Comparison of Distribution of 47 Individual Scores in a Single Class of Average 
Variability with the Distribution of the Grade Averages of 51 Classes 





No. 


Scores 




47 
Si 


30-39 

1 


45 

6 

11 


55 
16 
29 


65 
13 

7 


75 
7 

2 


85 

4 

1 


95 


Classes 
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The fact shown is a constant feature of the returns for all the 
tests and grades, and can only mean that the differences in the 
abilities of individual children are greater factors in determining 
relative rank in school work than all the differences in abilities 
of teachers, courses of study, or methods of work combined. The 
result is that the grades in all the schools overlap to an extent well- 
nigh incredible, and thus the data from this more extended investiga- 
tion fully confirms the conclusions reached in a discussion of this 
point in a previous article, although in that case the data was 
derived from a single school. 

TABLE III 
Grade Averages from Total Distributions 



Grade No. 



Average 

of Scores 

for Each 

Test 



No. 1 



No. 2 No. 3 



No. 4 



No. s 



No. 6 



Ats. Rts. 



No. 7 



Ats. Rts 



No. 8 



Ats. Rts, 



1 
2 
3 

4 
5 
6 

7 
8 

9 
10 
11 
12 
r 3 
14 



55 

75 

525 

1,222 

i,i77 
1,282 

i,432 

J,37o 

412 

216 

151 
169 
462 
131 



19 

25 
32 
37 
40 

45 
47 
45 
47 
48 

56 
5i 



10 
16 
23 

3° 

34 
38 
43 
45 
43 
44 
44 
5° 
58 



28 

35 
38 
44 
47 
46 
48 
49 
56 
59 



102 
108 
112 
114 
112 
116 
124 



2.8 
3-7 
4-4 
5-i 
5-9 
6.8 
6.9 
7.2 
7-9 



2 . 1 

2-5 

3-4 
4-4 



7-7 
8.6 

9-7 



5- 

6. 

9 

10. 
5-2 11. 
6.1J13, 
6.413, 
6.7,14. 
7-4 14 
7-2:i4, 
8.2 16, 
9.1 17.2 



8,679* 



1.7 

3-6 

5-3 

6.9 

7-6 

8.9 

95 

9-5 

9-4 

10.8 

12.6 

11. 8 



2.7 
2.6 
2.8 
3-4 
3-7 
4-i 
4-i 
4-i 
4-5 
4.6 

5-3 

5-4 



0.6 

0.8 
1 .2 

i-7 
2.2 
2.7 
3-i 
3-i 
i-3 
3-6 
4.0 
4.1 



* Certain of the tests were omitted by several schools, thus materially reducing the average. For 
most of the tests, however, the total number of scores is over 9,000. 

From the discussion above it is evident that the average for any 
grade represents simply a central tendency from which individuals 
vary widely. Yet in many ways the average scores of each group, 
given in Table III, and shown graphically in Fig. 3, reveal the char- 
acter of the development of each of the abilities through the school 
and show what should be true of the development of the individual. 
These data, then, in connection with the distributions from which 
the averages are derived furnish the desired basis for the selection 
of standard scores. 



134 



THE ELEMENTARY SCHOOL TEACHER 



\ 


\ 


V 








\ 










\ 










\ 










I 










\ 














\ 










V 






i 




\ 




T3 






\ 












. 


. 







\ 


\ 










I 


\ 










\ 




























\ 










\ 












\ 




> 








\ 


w 


\ 






* 




\ 






c 


% 
s 












i. 





















H ° a > ~ « ^ - " 

55 °— "-s -■= s-S"c 

a, • * 3 



?f- - : 



: 5 ■ -5 t e I 

is £ r "II 






■+-> O 



•a 
■> 



12 










U 










H 










3 










5 .S! 










5 E 










(O < 










to 


C 








S 

D _. 
O 
w - 


I 








a 








£ 



















c 


# »■ 


SI 






5 






\ 












1 




































\ 












\ 














\ 












\ 












\ 






I 


> 




> 


\ 






2 












. 




. 


, 















" 










I 










£ 


— Ll. 
\ I 
\ \ 












V 

\ 






c 




\ V 
\ V 






« i 






l 










L \ 




T5 






\ 1 




(/I 





















- 9 

K 




c \ 

'-5 \ 

"0 \ 



5 


* I 


• i 


J 


? * 












D 




























































* — 










Li 

3 














\ 




. 


A- 



u 






































































a 
M 














C 














a 






























TJ 














a 





























T3 




o o 

**- o 

s; s 

3 2 

o 

a §* 

^ w 

— d) 
T3 > 

•s-s 

... C3 

1^ 
Sj 

of OT 



o 

CO 



to g 

§ i 

.8 > 

g T3 



o 



£ 


4~) 
U 




In 




ert 




^ 



STANDARD SCORES IN ARITHMETIC 



135 



This table means when put in words that in the third grade, for 
example, there were 525 individual scores in each test — or about 
525. If in some test there were more, as for example, addition, 
there were enough less in division to make the average number of 
tests 525. In the addition tests it was found that the standard or 
average ability was that exhibited by a child who could record 26 
in a minute. The standard in subtraction was 19 per minute, and 
so on. 

The writer regrets exceedingly that there is not space to print 
the distributions in full; for the selection of standard scores must 
for the present be a matter of individual judgment. Many will 
be content to use the average scores as standards, and if, indeed, 
no children in any grade had a lower score than the present average 
for that grade, the increase in the efficiency of our schools in these 
abilities would be very great. But the writer feels that a higher 
score than the average is desirable for a standard and the scores 

TABLE IV 
Standard Scores 



Test No. 


No. 1 


No. 2 


Nos. 3 

AND 4 


No. s 


No. 6 


No. 7 


No. 8 


Ats. 


Rts. 


Ats. 


Rts. 


Ats. 


Rts. 


Grade 3 


26 
34 
42 
S° 
58 
63 
65 


19 

25 

31 

38 

44 
49 
SO 


16 
23 
30 

37 
44 
49 
So 


58 
72 
86 

99 
no 
117 
120 




2.1 
3-0 
4.0 
5-0 
6.0 
7.0 
7-8 


5-o 
7.0 
9.0 
11. 
i3-o 
14.4 
i5-o 


2.7 
3-3 
4-9 
6.6 

8-3 
10. 
n. 


2.0 

2.6 

3-1 
3-7 

4-2 

4.8 
5-o 






3 

4 
S 
6 

7 

8 


7 
8 
8 
8 
8 


i-7 


Grade 5 




2 8 


Grade 7 


3-4 
4.0 

4-3 


Grade 8 


Grade 9 









given in Table IV were derived as follows : For each test an eighth- 
grade score was selected such that it was equalled or exceeded by 
thirty per cent of the eighth-grade children measured. This score 
was plotted and a smooth curve drawn, having the same general 
form as the average curve and coinciding with it in the lower grades. 
The scores for each of the other grades were then determined from 
the graph. 

While the results given above were obtained by the use of special 
tests, and while for exact comparison the reproduction of the 
identical tests and the identical conditions are necessary, the 
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writer hopes that teachers generally will appreciate the fact that 
the scores above furnish definite objective standards toward which 
to work. Accurate measurements are needed twice a year; once 
at the beginning for purpose of diagnosis, and again at the close 
of the year to measure the effect produced by the teaching effort. 
But for general class work, approximate values are all that are re- 
quired. The scores in Table IV should be translated into words: 
thus, at the end of a year's work, an eighth-grade child should be 
able to copy figures in pencil on paper at the rate of 117 figures a 
minute; to write answers to the multiplication combinations at 
the rate of 49 answers per minute; to read simple one-step problems 
of approximately 30 words in length and decide upon the operation 
to be used in their solution at the rate of 8 examples a minute with 
an accuracy of 90 per cent; to work abstract examples of approxi- 
mately 10 figures (twice as many for addition) at the rate of 14.4 
examples in 12 minutes with an accuracy of 70 per cent; to 
solve two-step problems of approximately 10 figures at the rate of 
5 in 6 minutes with an accuracy of 75 per cent. At the present 
time 70 per cent of the eighth-grade children cannot meet these 
standards. But it must be borne in mind that 3 per cent of the 
fifth-grade children can, and that experience has shown that indi- 
vidual care and a very little, well-managed drill produces marked 
changes in the abilities of most children. 

A word of caution is necessary for those who use these standards. 
The writer is well aware of the disfavor into which drill has fallen, 
and he is heartily in sympathy with the ideas and principles which 
lead to its disuse. On the other hand, no fair-minded person can 
consider the wide range of variation in all the classes and not realize 
that something is wrong. It may well be that the pendulum has 
swung too far. Inordinate drill is surely harmful. Drill out of place 
is also harmful. But drill in itsproper place — after, and not before, un- 
derstanding control — and in proper amount — enough to insure the 
minimum ability necessary to effective work but not enough to in- 
volve waste of teaching effort — such drill is one of the vital factors in 
efficient teaching. The scores above, while necessarily tentative only, 
have been selected with due regard for such critical points in the de- 
velopment of these abilities as have been determined. It is probable 
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that development much beyond the points indicated by the standard 
scores — and such development is possible— would involve waste of 
effort and, if not positively harmful, would be useless from the stand- 
point of efficient, general development. Accurate determination 
of the proper amount and form of drill required to produce the 
standard abilities, and similar determinations of standard, yearly 
growths, are the next problems that will be attacked. For the 
writer considers that the present investigation has proved that 
co-operative work on such educational problems is both feasible 
and of value. 

The writer cannot bring this discussion to a close without a 
plea for both the teacher and the child. When a child's score 
exceeds the standard for his grade, he will ordinarily need little 
further attention. The ordinary procedure in grade work furnishes 
him adequate opportunities and incentives. But when the score is 
below the standard the case is entirely different. The fault may 
be in the child. A defective mentality may render the development 
of a given ability absolutely out of the question. But in such 
cases it would seem the height of folly and a rank injustice to give 
a teacher a fourth-grade intellect upon which to work and expect 
him to secure eighth-grade achievement. Or the child may be 
normal, but some peculiarity of its mental equipment causes a 
marked bias in one direction or another. The child that learns 
addition readily, but falls far below the standard in multiplication, 
needs special care and special work until the standard ability in 
multiplication is attained. A balanced, many sided development 
is the right of all. It is in such cases that standard scores are of 
the greatest service. On the other hand, the fault may be in the 
teaching, the course of study, or the methods of work. But here 
again the number of children who reach, or fail to reach, the stand- 
ard is the measure of efficiency. From every point of view, there- 
fore, standardization of all school subjects, and the use of standard 
scores offer a possible escape from present, unsatisfactory condi- 
tions. 



